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APPARATUS AND METHOD FOR MONITORING SYSTEM STATUS IN AN EMBEDDED 

SYSTEM 

BACKGROUND OF THE INVENTION 

[0001] Embedded systems, e.g. computers that are components in larger systems and rely on 
their own microprocessor, are becoming commonplace. For example, embedded systems are 
being used in personal electronic items such as PDAs, inkjet printers, cell phones, and car 
radios. Embedded systems are also becoming critical components of many industrial devices 
such as test and measurement systems, including the AGILENT TECHNOLOGIES J6802A and 
J6805A Distributed Network Analyzers. 

[0002] To meet this growing demand, operating system providers, such as MICROSOFT, 

provide embedded versions of their normal operating systems. One of the more recent offerings 
from MICROSOFT is WINDOWS XP EMBEDDED (referred to herein as XPE). Embedded 
systems, such as XPE, provide functionality in keeping with the nature of embedded systems, 
including mechanisms (such as write filters) for protecting critical data, such as the operating 
system or files from being corrupted. 

[0003] Embedded systems, much like personal computer systems generally store data in 

memory and/or mass storage. Mass storage may comprise, for example, a variety of persistent 
components, including removable and non-removable storage drives such as hard drives and 
compact flash media. Memory is largely comprised of non-persistent components, such as 
RAM. The data stored in memory is subject to corruption due to power surges, hard power 
downs, viruses, and. so on.. Even though embedded system have functionality, such as write 
filters, that seek to prevent corruption, they are not always effective. 

[0004] Corruption can lead to intermittent failures that may compromise the operation of the 
system, in test and measurement systems, corruption can lead to erroneous test results, incorrect 
diagnosis and unnecessary repairs and troubleshooting. Typically, the sooner corruption is 
detected, the lighter the potential damaged caused by the corruption. Many times corrupted data 
may be cleared from non-persistent components by simply. restarting the affected program or by 
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rebooting. Accordingly, it is desirable that the various programs running on an embedded 
system be monitored to ensure that remedial action can be taken as soon as possible. 

[0005] In many non-embedded systems, a video display, e.g. a monitor, is provided to facilitate 
monitoring the operation of the system. By watching the monitor it is possible to detect or 
predict some error that occur due to corruption. Further, the operating system can be configured 
to create a display on the monitor warning of error conditions caused by corruption or other 
factors. 

[0006J R has become popular to use embedded systems without any form of display, monitor or 
otherwise. Such embedded systems are often referred to as "headless" systems. Interaction with 
headless systems, if any, is usually via a communication channel, such as the Internet. An 
example of a headless system is a distributed test device co-located with network components or 
other convenient access point. Distributed test devices monitor the component or access point 
and report the results of the monitoring using the network being monitored or some other 
communication channel. 

[0007] Human involvement with headless test system is optimally limited to installation and 
activation of the system. In general, it is the goal of most embedded systems to be totally 
autonomous. It is thus desirable that an embedded system not only be reliable, but also self- 
monitoring. To achieve the goal of being both reliable and self-monitoring, embedded systems, 
headless or otherwise, should be programmed for the detection of conditions, such as corruption, 
that can cause the system to operate in a way other than intended. One known method is the use 
of watchdogs which track the execution of processes and execute some form of action if the 
watched process should fail. Failure modes include process crashes due to faulty code 
execution; hanging (failing to progress in a useful way); and deadlocking (stopping execution 
due to resources being unavailable). 

[0008] Most known embedded system watchdogs are hardware based and track a single process. 
The action on failure is typically a reboot. Hardware based watchdogs typically operate by 
monitoring a specified memory or register location. The location is assigned to the process, 
which is modified to update the location on a regular basis, termed "checking-in." If the 
location is not updated on schedule, the watchdog initiates a reboot. 
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[0009] Unfortunately, hardware based watchdogs only track a single process and, due to their 
nature, are not available on all computers. Further, the action upon failure is a simple reboot. 
Accordingly, the present inventors have recognized a need for apparatus and methods for 
tracking multiple processes and providing flexible actions upon failure. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0010] An understanding of the present invention can be gained from the following detailed 
description of the invention, taken in conjunction with the accompanying drawings of which: 

[0011] FIG. 1 is a block diagram of an embedded system in accordance with an embodiment of 
the present invention. 

[0012] FIG. 2 is a flow chart of a method in accordance with an embodiment of the present 
invention. 

[0013] FIG. 3 is a flow chart of a method in accordance with an embodiment of the present 
invention. 
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DETAILED DESCRIPTION 

[0014] Reference will now be made in detail to the present invention, examples of which are 
illustrated in the accompanying drawings, wherein like reference numerals refer to like elements 
throughout. In addition to the described apparatus, the detailed description which follows 
presents methods that may be embodied by routines and symbolic representations of operations 
of data bits within a computer readable medium, associated processors, embedded systems, 
general purpose personal computers and the like. The methods presented herein are sequences of 
steps or actions, often executed by a processor or dedicated circuits, leading to a desired result, 
and, as such, encompasses such terms of art as "software," "routines," "computer programs," 
"programs," "objects," "functions," "subroutines," and "procedures." These descriptions and 
representations are the means used by those skilled in the art effectively convey the substance of 
their work to others skilled in the art. 

[0015] The methods of the present invention will be described with respect to implementation 
on a headless embedded computer system using an embedded operating system. Those of 
ordinary skill in the art will recognize that the apparatus and methods recited herein may also be 
implemented on embedded systems with monitors and even on general purpose computing 
devices, with or without monitors. While the present invention will be described as being 
implemented using PC based devices operating with the MICROSOFT WINDOWS XP 
EMBEDDED (hereinafter XPE), the apparatus and methods presented herein are not inherently 
related to any particular device or operating system. Rather, various devices and operating 
systems may be used in accordance with the teachings herein. Machines that may perform the 
functions of the present invention include those manufactured by such companies as AGILENT 
TECHNOLOGIES and HEWLETT PACKARD as well as other manufacturers of embedded 
systems and general computing devices. 

[0016] FIG. 1 is a block diagram of an embedded system 100 in accordance with an 

embodiment of the present invention. The embedded system 100 generally comprises: a CPU 
1 10 connected by a bus. 1 12 to: RAM 1 14; disk storage 1 16; DMA (direct memory access) 
controller 118; timers 120; and an I/O subsystem 122. The disk storage 116 stores the operating 
system along with programs and data and may be divided into a plurality of partitions. The 
embedded system 100 shown in FIG. I lacks a monitor and, as such, is a headless system. 
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[0017] An indicator 124, connected to the I/O subsystem 122, provides an indication that the 
embedded system is operational in addition to being ON. Preferably, but not necessarily, this 
indication takes the form of a two color light emitting diode wherein one color is illuminated 
when certain conditions, generally reflective of an operational system, are present and a second 
color with the conditions are not present - generally indicating a non-operational system. A 
suitable indicator and control circuitry therefore is described in co-pending United States Patent 
Application Number 10/726,769 entitled APPARATUS AND METHOD FOR INDICATING 
SYSTEM STATUS IN AN EMBEDDED SYSTEM. The '769 applications is assigned to the 
assignee of the present application and incorporated herein by reference. 

[0018] It is to be noted that the block diagram shown in FIG. 1 has been simplified to avoid 
obscuring the present invention. Functional components have been left out or conveniently 
combined with other functional components selected for inclusion in FIG. 1 . Further, the block 
diagram shown in FIG. 1 is but one of many architectures upon which the present invention may 
be practiced. Specifically, the architecture show in FIG. 1 is sometimes termed the "PC 
architecture' 5 because it resembles an early personal computer. This architecture was chosen for 
describing the present invention, as it is universally recognizable to those of ordinary skill in 
embedded system design. 

[0019] The present invention generally comprises a software-based watchdog comprising two 
parts: a registration procedure; and a watchdog program. The registration procedure facilitates 
registering programs to be watched (the "watched programs," "registered programs," or 
"identified programs") and the performing of confirmation actions by the watched programs. 
The watchdog procedure checks for the periodic completion of the confirmation actions by the 
watched programs and, upon the failure to complete a confirmation action executes defined 
remedial actions. In accordance with at least one preferred embodiment implemented using 
XPE, the registration procedure is provided in a dynamic linked library file (dll) while the 
watchdog procedure is embodied by a service. 

[0020] In accordance with perhaps the preferred embodiment, the registration procedure 
provides for at least three functions: register; check-in; and unregister. The register function 
permits a program to register with the watchdog and pass various parameters including how 
often the watchdog should expect the program to check in (the delta time) and what action or 
actions are to be taken if the program fails to check in (the remedial action(s)). The check-in 
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function simply passes a timfestamp to a common memory location to serve as the confirmation 
action. The unregister function removes the program from the watchdog's list of programs 
being watched. 

[0021] The watchdog procedure periodically walks through the list of registered programs and, 
for those programs requiring service (as defined by the delta time) checks the memory location 
for the timestamp recorded by the check-in function. When the timestamp plus the delta time is 
less than the current time, it is deemed that a failure has occurred. The watchdog procedure logs 
the failure and takes a specified remedial action. 

[0022] In one embodiment, remedial actions comprise executable files that are called by the 
watchdog procedure. In perhaps the preferred embodiment, the register function passes the 
watchdog procedure an array designating a series of executable files. It is also possible to 
simply pass a pointer to the array. The watchdog procedure maintains a counter, which can be 
implemented as part of the pointer, which is incremented each time a registered program fails a 
check. Upon the detection of a failure, an executable, file from the array, selected using the 
counter, is executed. For example, upon the first failure the first executable file in the array is 
executed; upon the second failure the second executable file in the array is executed; etc... This 
allows different remedial actions to be taken each sequential time the registered program fails. 
For example: the first failure could result in the program being restarted; the second failure 
could result in a system restart; and the third failure could result in a system shutdown. 

[0023] FIG. 2 is a flow chart of a method in accordance with an embodiment of the present 
invention. More specifically, FIG. 2 is a method implementing one embodiment of the 
registration procedure. As noted, the registration procedure may be implemented as a dynamic 
linked library when the selected operating system is XPE. 

[0024] The method starts in step 200 when a program (application, system or otherwise) seeking 
service calls the routines of the present method. In accordance with at least one embodiment, 
routines embodying the present method are contained in a common dynamic linked library with 
offers three services: Register; Check-in; and Unregister. In step 202 a determination is made as 
to which of the three services is being requested. 
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[0025] When registration service is requested, the method proceeds to step 204 wherein the 
program requesting registration is identified. Subsequently, in step 206, a delta time is 
identified. The delta time is the maximum allowable time allowed for the program to check-in. 
Next in step 208, a remedial action list is identified and a pointer to the list, termed the error 
pointer EPn, is initialized to zero. 

[0026] In at least the present embodiment, the remedial action list is an array containing 

identifiers of executables, one of which is to be executed each time the program fails to check-in 
within the delta time. Each time a remedial action is needed, the error pointer is increased by 
one and the next executable in the remedial action list is executed. It is envisioned that the 
executable will instruct the system to take increasingly sever measures. For example the first 
action could comprise shutting down and restarting the program. The second and third actions 
could simply be restarting the entire system, while the fourth action could be shutting the system 
down for maintenance. 

[0027] In step 2 10, a registration entry is generated. In perhaps the preferred embodiment, the 
registration entry is a key in the XPE registry. The key could contain, for example, the program 
name, the delta time, the location of the remedial action list and a pointer into the list. In 
general, the information required to create the registration entry can be passed to the routine by 
the calling program (thereby constituting the steps of identifying). It may also be beneficial at 
this time to add a time stamp to the registration entry as a first check-in service. Alternatively, a 
request for check-in service can be issued immediately upon completion of the register service. 
The registration service thereafter ends in step 218 and the program initiating the registration 
service is now a watched program. 

[0028] If, back in step 202, check-in service was requested by a watched program, the method 
proceeds to step 212 and the current system time is retrieved. Subsequently, in step 214, a time 
stamp is written to a location associated with the registration entry. Each time the check-in 
service is called, the time stamp is over written with the then current time. The watched 
program should be programmed to repeatedly call the check-in service within the delta time. 
The check-in service thereafter ends in step 2 1 8. 



[0029] It; back in step 202, the remove registration service was requested, the method proceeds 
to step 216, and the registration entry is removed. In the present embodiment, this is 
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accomplished by simply deleting the registry key in XPE. The remove registration service 
thereafter ends in step 218 and the program requesting the remove registration service is no 
longer a watched program. 

[0030] FIG. 3 is a flow chart of a method in accordance with an embodiment of the present 
invention. More specifically, FIG. 3 is a method implementing an embodiment of the watchdog 
program. As noted, the watchdog program may be implemented as a service when the selected 
operating system is XPE. 

[0031] The method starts in step 300 upon being invoked, preferably as part of a startup routine. 
In step 302, a list of programs currently registered, for example using the registration procedure 
described with respect to FIG. 2, is obtained. In the case of the embodiment described with 
respect to FIG. 2, the list of registered program can be retrieved using standard XPE registry 
commands. Once the list has been retrieved, or at least a pointer has been created and set at the 
start of the list, each entry in the list is checked in a loop comprising steps 304 through 310. 

[0032] In step 304, the time stamp (TS), delta, and error pointer (EPn) of next entry (the first 
entry if this is the first pass) is retrieved. Next, in step 306 the current time (CT) is retrieved. In 
step 308 the sum of the time stamp associated with the entry and the delta associated with the 
entry is compared to the current time. If the sum is greater than the current time, the program is 
deemed to be operating correctly and the method goes to step 3 10 to check if there are more 
watched programs to check. If unchecked watched programs remain, the method returns to step 
304 and the time stamp (TS), delta, and error pointer (EPn) of next entry is retrieved. 

[0033] If in step 308, the sum of the time stamp and the delta is less than the current time, the 
watched program has failed to timely request a check-in and' the method proceeds to step 312. 
In step 312, the error pointer for the subject watched program (EPn) is incremented by 1 . Next 
in step 314, the remedial action pointed to by the error pointer is undertaken. Optionally, a log 
of the failure and the remedial action can be made. Upon completion of the remedial action, the 
method goes to step 3 10 and to check if there are more watched programs to check. If any 
watched programs remain to be checked the method returns to step 304 and the time stamp (TS), 
delta, and error pointer (EPn) of next entry is retrieved. 
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[0034] Once all watched program have been checked, the method proceeds to step 316 

where the methods waits for a prescribed interval prior to returning to step 302 to recheck the 
watched programs. 

[0035] It will be appreciated by those skilled in the art that changes may be made to the 

above described embodiment without departing from the principles and spirit of the invention, 
the scope of which is.defined in the claims and their equivalents. For example, the error pointer 
(EPn) can be reset after the expiration of a certain period of time, for example once a day. 
Alternatively, one for every X(e.g., 100) successful and uninterrupted check-ins the error 
pointer could be reduced. Optionally, any reduction or resetting of the counter can be logged. 

[0036] By way of further example, Tables 1 through 4 present source code, compatible with 

XPE, implementing certain features of the present invention. In this implementation, the only 
remedial action is a system restart. Further, this implementation uses a LED indicator on the 
embedded system to communicate system status with an operator. One implementation of an 
LED indicator is discussed in co-pending United States Patent Application Number 10/726,769 
incorporated herein by reference. 

[0037] TABLE' 1 : AgentWatchDogCommon.h 



#if [defined ( AFX_AGENTWATCHDOGCOMMON_H) 
ttdefine' AFX_AGENTWATCHDOGCOMMON_H 

#include "TCHAR. H" 
#include <time.h> 

#define MAXAWDLOG 50000 

const TCHAR WATCHDOG [ ] = _T ( "SOFTWAREWAgilent WAgentWatchDog" ) ; 
const TCHAR TIME STAMP { ] = _T { "Last Stamp" ) ; 

const TCHAR TIMEDELTA [ j = _T (" Delta ") ; 

const TCHAR REBOOTRETRY [ ] = _T ( "MaxRebootRetry " ) ; 

#define AWDMaxAppID 80 

#endif //! defined (AFX AGENTWATCHDOGCQMMON H) 



[0038] TABLE 2: AgentWatchDogDli.cpp 



// AgentWatchDogDli.cpp : Defines the entry point for the DLL application. 
#include "stdafx.h" 

#include " AgentWatchDogDll . h" • 
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// register should be called once at the start of your application 

// strAppID is any unique identifier for your application (typically its 

name) 

// nDelta is how long in minutes the wtchdog should wait for your application 
// to call AWDTimestamp before deciding the app is dead and rebooting the box 
AGENTWATCHDOGDLL_API UINT AWDRegs iter ( char * strAppID, UINT nDelta, UINT 
nRetryReboot ) { 

if (strlen (strAppID) +1 >.= AWDMaxAppID) 
strAppID[AWDMaxAppID-l] = NULL; 



UINT nReturnCode = AWDSuccess; 

HKEY hkTheKey; 
HKEY hkWDKey; 

DWORD dwDisposition; 
DWORD dwResult; 

DWORD nDeltaSeconds = nDelta*60; 

time_t currentTime; 
time ( &currentTime) ; 
DWORD dwTime = currentTime; 

dwResult = RegCreateKeyEx ( HKEY_LOCAL_MACHINE, WATCHDOG, 0 , REG NONE , 
REG_OPTION_VOLATILE, " 

KEY_WRITE|KEY_READ, NULL, ShkTheKey, 

SdwDisposition) ; 

if ( dwResult == ERROR_SUCCESS •) { 

dwResult = RegCreateKeyEx ( hkTheKey, strAppID, 0,REG_NONE, 
REG_OPTION_VOLAT I LE , 

KEY_WRITE|KEY_READ, NULL, &hkWDKey, 

SdwDisposition) ; 



if ( dwResult == ERROR_SUCCESS ) { 

if ( dwDisposition == REG_OPENED_EX I ST I NG_KE Y ) 
nReturnCode = AWDAlreadyRegistered; 



if ( RegSetValueEx ( hkWDKey 
(CONST BYTE* ) &nDel taSeconds ^ sizeof ( DWORD) 

!= ERROR_SUCCESS ) { 
// error message 
nReturnCode = AWDFailed 

} , 

if ( RegSetValueEx ( hkWDKey 
(CONST BYTE*) &dwTime, si zeof ( DWORD) ) 

!= ERROR_SUCCESS ) { 
// error message 
nReturnCode = AWDFailed 
} • • 

if ( RegSetValueEx ( .hkWDKey 
(CONST BYTE*) &nRetryReboot, sizeof (UINT) ) 

!= ERROR_SUCCESS ) { 
// error message 
nReturnCode = AWDFailed 

} 

RegCloseKey (hkWDKey) ; 



TIMEDELTA, NULL, REG_DWORD, 



TIMESTAMP, NULL, REG_DWORD, 



REBOOTRETRY, NULL, REG DWORD, 



) else ' { 
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// error message 
nReturnCode = AWDFailed; 

.} 

RegCloseKey (hkTheKey) ; 
} else { 

// error message 
nReturnCode = AWDFailed; 

} 

return nReturnCode; 

} 

// Unregister should be called if your application exits normally and does 
// not want the watchdog to reboot the box because your app ended 
AGENTWATCHDOGDLL_API UINT AWDUnregs i ter (char* strAppID) { 

if (strlen (strAppID) +1 >= AWDMaxAppID) 
StrAppID [AWDMaxAppID- 1] = NULL; 

UINT nReturnCode = AWDSuccess; 

HKEY hkTheKey; 
DWORD dwDisposition; 
DWORD dwResult; 

dwResult = RegCreateKeyEx( H KE Y_LOC AL_M ACH I N E , WATCHDOG, 0, REG NONE 
REG_OPT I ON_VOLAT I LE , ' " 

KEY_WRITE|KEY_READ, NULL, &hkTheKey, 

&dw Disposition) ; 

if( dwResult == ERROR_SUCCESS ) { 

if ( dwDisposition == REG_OPENED_EXISTING_KEY ) { 

if (RegDeleteKey (hkTheKey, strAppID) !- ERROR_SUCCESS ) 
nReturnCode - AWDNotRegis tered; 

} else { 

nReturnCode = AWDNotRegis tered; 

} 

RegCloseKey (hkTheKey) ; 
} else { 

// error message 
nReturnCode = AWDFailed; 

} 

return nReturnCode; 

} • ' 

// Timestamp should be called at least every nDelta minutes to keep the 
/ / watchdog from deciding your app has gone awol 
AGENTWATCHDOGDLL_API UINT AWDTimeStamp ( char * StrAppID) { 

if (strlen (strAppID) +1 >= AWDMaxAppID) 
strAppID[AWDMaxAppID-l] = NULL; 

UINT nReturnCode = AWDSuccess; 

HKEY hkTheKey; 
HKEY hkWDKey; 
DWORD dwDisposition; 
DWORD dwResult; 

• time_t currentTime; 
time ( ScurrentTime ) ; 
DWORD dwTime = CurrentTime; 

dwResult = RegCreateKeyEx( HKEY_LOCAL_MACHINE, WATCHDOG, 0 , REG NONE 
REG_OPTION_VOLATILE, ~ 

i KEY WRITE | KEY READ, NULL, &hkTheKey, 
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&dwDisposition) ; 

if( dwResult == ERROR__SUCCESS ) { 

if { dwDisposition == REG_OPENED_EXISTING_KEY ) { 

if( RegOpenKeyEx (hkTheKey, strAppTD, 0L, KE Y_SE TUVALU E, &hkWDKey) 
== ERROR_SUCCESS) { 

if {RegSetValueEx ( hkWDKey, TIMESTAMP, NULL, REG_DWORD, 
(CONST BYTE* ) &dwTime, sizeof ( DWORD) ) ) 

nReturnCode = AWDFailed; 
RegCloseKey (hkWDKey) ; 
} else { 

nReturnCode = AWDNotRegis tered; 

} 

} else { 

nReturnCode = AWDNotRegis tered; 

} 

RegCloseKey (hkTheKey) ; . 
} else { 

// error message 
nReturnCode = AWDFailed; 

} 

return nReturnCode; 

} 



[0039] TABLE 3: AgentWatchDogDli.h 



// The following ifdef block is the standard way of creating macros which 
make exporting 

// from a DLL simpler. All files within this DLL are compiled with the 
AGENTWATCHDOGDLL_EXPORTS ' 

// symbol defined on the command line, this symbol should not be defined on 
any project 

// that uses this DLL. This way any other project whose source files include 
this file see 

// AGENTWATCHDOGDLL_API functions as being imported from a DLL, wheras this 
DLL sees symbols 

// defined with this macro as being exported. 
#ifdef AGENTWATCHDOGDLL_EX PORTS 

#define AGENTWATCHDOGDLL_API declspec (dllexport ) 

#else 

#define AGENTWATCHDOGDLL__API declspec (dllimport ) 

#endif 

tinclude " AgentWatchDogCommon . h" 

#define AWDSuccess 0; 
#define AWDNotRegistered 1; 
#define AWDAlreadyRegis tered 2; 
#define AWDFailed 3 

// register should be called once at the' start of your application 

// strAppID is any unique identifier for your application (typically its 

name) 

// nDelta .is how long in minutes the watchdog should wait for your 
application 

//to call AWDTimestamp before deciding the app is dead and rebooting the box 
// nReebootRetry is the max number of rebbots that should be done in a 24 
hour period 
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// before giving up 

// Return Codes: AWDSuccess, AWDAlreadyRegis tered, AWDFaiied 

AGENTWATCHDOGDLL_API UINT AWDRegs i ter ( char * strAppID, UINT nDelta = 5, UINT 
nRetryReboot - 5) ; . 

// Unregister should be called if your aplication exits normally and does 
// not want the watchdog to reboo the box because your app ended 
// Return Codes: AWDSuccess, AWDNotRegistered, AWDFaiied 
AGENTWATCHDOGDLL_API UINT AWDUnregsi ter ( char * strAppID); 

// Timestamp should be called at least every nDelta minutes to keep the 

/ / watchdog from deciding your app has gone awol 

// Return Codes: AWDSuccess, AWDNotRegistered, AWDFaiied 

AGENTWATCH DOGDLL_AP I UINT AWDTimeStamp ( char * strAppID); 



[0040] TABLE 4: AgentWatchDog.cpp 

// AgentWatchDog.cpp : Defines the entry point for the application. 
// 

#include "afx.h" 

#include "stdafx.h" 

#include "AgentWatchDogCommon . h " 

#include "AutoRunCommon . h" 

/* Prototypes of Inp and Outp used for LED control */ 

short _stdcall Inp32 (short PortAddress); 

void _stdcall Out32 (short PortAddress, short data); 



LPVOID GetSysErrorMsg (DWORD dwErrCode) 
{ 

// ■ 

// LocalFreeO must be used on the returned pointer to free the memory 

// allocated by FormatMessage ( ) 

// • 

LPVOID IpMsgBuf; 

FormatMessage ( 

FORMAT_MESSAGE_ALLOCATE_BUFFER | 
FORMAT_MESSAGE_FROM_SYSTEM | 
FORMAT_MESSAGE_IGNORE_INSERTS, 
NULL, 

dwErrCode, 

MAKE LANG I D ( LANG_NEUTRAL , SUBLANG_DEFAULT ) , //Default language 
- (LPTSTR) &lpMsgBuf, 
0, 

NULL 

) ; 

return IpMsgBuf; 

) 

void LogToFile (CString strLogData) { 
TRY { 

CString strLogPath; 
if (IsRackMount ( ) ) { 

DWORD nDataSize = 1-28; 
char strSystemDir [128] ; 
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HKEY hWFKey; 

// Attempt to get the directory info from the registry 
if( RegOpenKeyEx { HKEY_LOCAL_MACHINE , PLATFORM, 

0, KEY_READ, ShWFKey ) == ERROR_SUCCESS ) { 
// read the entry 

if ( RegQueryValueEx (hWFKey, SYSTEMDIR, 0, NULL, (unsigned 
char*) strSystemDir, 

&nDataSize) != ERROR_SUCCESS ) { 

} 

RegCloseKey (hWFKey) ; 
} else { 

strcpy (strSystemDir , S Y STEM D I RDE FAULT ) ;' 

} 

strLogPath = strSystemDir; 
} else{ 

// Get' executable directory 
char szPathName [_MAX_PATH] ; 
, GetModuleFileName ( NULL, szPathName, sizeof( szPathName ) ); 
char* cptr = strrchr ( szPathName, ' W ); 
if ( cptr ) 

*cptr = 0x0; 
strLogPath = szPathName; 
} 

strLogPath = strLogPath + " WAgentWatchDog . log" ; 

CFile logFile( strLogPath, 

CFile : :modeCreate | CFile :: modeNoTruncate | 
CFile : : modeReadWr ite j CFile : : shareDenyWr ite ) ; ' 

logFile . SeekToBegin { ) ; 

UINT nCurrentWriteOf f set = 0; 

UINT nBytesRead = logFile . Read ( &nCurrentWri teOf f set , 
sizeof (nCurrentWriteOf f set ) ) ; 

// if new file or need to loop; reset offset 
if ((nBytesRead < sizeof (nCurrentWriteOf f set) ) || 
(nCurrentWriteOf f set > MAXAWDLOG) ) { 
nCurrentWriteOffset = si zeof (nCurrentWriteOf f set ) ; 
logFile . SeekToBegin (■) ; 
logFile. Write ( SnCurrentWriteOf f set , 
sizeof (nCurrentWriteOffset) ) ; 
}. 

logFile. Seek( nCurrentWriteOffset, CFile :: begin ); 

CTime thetime = CTime : : GetCurrentTime ( ) ; 
CString strBuffer; 

strBuffer = thetime . Format (" %c" ) ; 
strBuffer += ": "; 
strBuffer += strLogData; 
strBuffer += 1 \r 1 ; 
strBuffer += 1 \n ' ; 



logFile. Write (strBuffer. GetBuffer (0) , strBuf f er . GetLength ( ) ) ; 
nCurrentWriteOffset += strBuf fer . GetLength () ; 
logFile . SeekToBegin () ; 

logFile. Write ( SnCurrentWriteOf f set , sizeof ( nCurrentWriteOf f set ) ) ; 
logFile. Flush ( ) ; 
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logFile . Close ( ) ; 

} 

CATCH (CFileException, e) ; 
END_CATCH 

} 



int APIENTRY WinMain (HINSTANCE hlnstance, 

HINSTANCE hPrevIns tance , 
LPSTR IpCmdLine, 
int nCmdShow) 

{ 

CString strRebootMsg; 
bool bKeepRunning = true; 

HKEY hWFKey; 
char strSystemDir [80] ; 

// Attempt to get the directory info from the registry 
DWORD dwResult = RegOpenkeyEx ( H KE Y_LOC AL_M ACH I N E , PLATFORM, 

0, KEY__READ, &hWFKey ); 

if ( dwResult == ERROR_SUCCESS) { 
DWORD nDataSize = 80; 
// read the entry 

if( RegQueryValueEx (hWFKey, SYSTEMDIR, 0, NULL, (unsigned 
char*) strSystemDir, 

&nDataSize) != ERROR_SUCCESS ) { 
strcpy(strSystemDir, SYSTEMDIRDE FAULT ) ; 

} 

RegCloseKey (hWFKey) ; 

} 

char strWatchDoglni [ 128]; 

sprintf (strWatchDoglni, "%s%s", strSystemDir, "\\AgentWatchDog.ini"); 

SYSTEMTIME sysTime; 

GetLocalTime ( &sysTime) ; 
int nLastHour = sysTime . wHour ; 

while (bKeepRunning) { ; : 

/ / do our thing about once a minute 
Sleep (1000*60)'; 

//if the clock . wrapped past midnight clear the file that tracks ■ 
reboot counts * 

GetLocalTime ( isysTime) ; 
int nHour = sysTime . wHour ; 

if (nHour != nLastHour) { // did hour change? 

if (nHour < nLastHour) { // did we wrap past midnight? 
// delete record of previous reboot counts 
DeleteFile (strWatchDoglni) ; 

} 

nLastHour = nHour; 

} 

// get current time 
time_t currentTime; 
time (&curre'ntTime) ; 
DWORD dwTime = currentTime; 

. //find executables 
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HKEY hkTheKey; - 

DWORD dwResult = RegOpenKeyEx ( HKEY_LOCAL_MACHINE, WATCHDOG , 0L, 

KEY_READjKEY_WRITE, &hkTheKey) ; 

if (dwResult == ERROR_SUCCESS) 
{ " " , ' 

FILETIME ft; 
int i=0; 

bool bMoreExecutables = true; 
while (bMoreExecutables && bKeepRunning ) 



I 



DWORD nLen = 80; 
char pChar [80] ; 



LONG IResult = RegEnumKeyEx (hkTheKey , i + +, 
pChar, &nLen, NULL, NULL, NULL, &ft); 

if (IResult == ERROR_SUCCESS) 

{ . 

HKEY hkLocalKey; 
DWORD dwLocalResult = RegOpenKeyEx (hkTheKey , pChar, 

0L, 

KEYJREAD, ShkLocalKey) ; 

DWORD dwDeltaResult; 

DWORD dwTimeStampResult; 
DWORD dwSize; 

if (dwLocalResult === ERROR_SUCCESS ) { 

DWORD dwDelta =0; 
DWORD dw Time St amp = 0; 
DWORD dwMaxReboot = 0; 

dwSize = sizeof ( DWORD) ; 

dwDeltaResult = RegQueryV.alueEx (hkLocalKey , 
TIMEDELTA, NULL, NULL, (LPBYTE) &dwDelta, &dwSize); 

dwSize = sizeof ( DWORD) ; 

dwTimeStampResult = RegQueryValueEx (hkLocalKey , 
TIMESTAMP, NULL, NULL, ( LPBYTE ) StdwTimeS tamp , SdwSize) ; 

dwSize = sizeof (DWORD) ; 

dwTimeStampResult = RegQueryValueEx (hkLocalKey , 
REBOOTRETRY, NULL, NULL, ( LPBYTE ) '&dwMaxReboot , SdwSize); 

// Test dwDeltaResult and dwTimeStampResult to be 

sure 

if ( dwDeltaResult ERROR_SUCCESS && 
dwTimeStampResult == ERROR__SUCCESS ) { 

Out-32 (888, 0) ; // Ties all of the data bits low 
(off) this is need to clear the other LED color 

Out32 (888, 4) ; // Turns the Bit on registar 4 

High (on) = Red LED 

// 1st check if 

if (dwTimeStamp + dwDelta < dwTime) { 
char strRebootCount [10] ; 

GetPrivateProf ileString ( "RebootCounts " , 

pChar, 

"0", 



strRebootCount, 



10, 
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strWatchDoglni ) 
dwMaxReboot ) { 

10); 

"RebootCounts" , 
NULL , 

strWatchDoglni ) 
"RebootCounts" , 

strRebootCount , 
strWatchDoglni ) 



DWORD nRebootCount = atoi ( strRebootCount ) ; 
if (dwMaxReboot == 0 | I nRebootCount < 



if (dwMaxReboot != 0) { 

ltoa (nRebootCount+1 , strRebootCount, 

// clear the old record from the file 
WritePrivateProf ileString ( 

pChar, 



WritePrivateProf ileString ( 



pChar , 



LastStampTime . Format ( "%c" ) ; 

time st amp; Last Times tamp was at 



} 



} 



// Log failure to file 
CTime LastStampTime ( ( time_t ) dwTimeStamp) ; 
CString s trTimeBuf f er ; 
strTimeBuf f er = 

strRebootMsg . Format ( "%s failed to 
' , pChar ) ; 

strRebootMsg += strTimeBuf fer ; 
LogToFile (strRebootMsg) ; 
bKeepRunning = false; 



} 



entry", pChar) 



} else { 

CString strMsg; 

strMsg. Format ("%s has a malformed registry 
LogToFile (strMsg) ; 



} else { 



bMcreExecu tables = false; 



} 



RegCloseKey (hkTheKey) ; 



} // while (true) ; 

//we only get here if some process didn't timestamp appropriately and we 
need to re-boot 

HANDLE hMyToken; ! 
TOKEN_PRIVILEGES tp; 
LUID luid; 

/ / open the token for our porcess 
if ! !QpenProcessToken ( GetCurrentProcesS ( ) , TOKEN ADJUST PRIVILEGES, 
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ShMyToken) ) { - 
LogToFile ( "Failed to OpenProcessTokens") ; 

} 

// lookup the LUID for the S E_S HUT DOW N_N AME privilege, 
if ( ! LookupPrivilegeValue (NULL, SE_SHUTDOWN_NAME , &luid) ) { 
LogToFile ( "Failed LookupPrivilegeValue"); 

} 

// setup to give ourselves the SE_SHUTDOWN_NAME privilege 
tp. PrivilegeCount ^ 1,- 
tp . Privileges 1 0] . Luid = luid; 

tp. Privileges [0] .Attributes = SE_PRIVILEGE_ENABLED; . 
if( ! AdjustTokenPrivileges (hMyToken, FALSE , &tp, 
sizeof (TOKEN_PRIVILEGES) , NULL, NULL) ) { 

LogToFile ( "Failed AdjustTokenPrivileges") ; 

} 

// check if it worked 

if (GetLastError ( ) != ERROR__SUCCESS ) { 
// LogToFile 

} 

// Reboot! 

LogToFile ("Reboot started")/ 
int i = 0; 

while (.! InitiateSystemShutdown(NULL / (char*) { LPCTSTR) s trRebootMsg, 3, 
true, true) && i < 20 ) 

. { 

LogToFile( "AgentWatchDog - failed Initiali zeSystemShut Down" ) ; 

LPVOID pErrMsg = GetSysErrorMsg (GetLastError <)) ; 

LogToFile ( (char*) pErrMsg) ; 

LocalFree (pErrMsg) ; 

Sleep (15000) ; 

i++; 

} 

Sleep (30000) ; 
i = 0; 

while ( !ExitWindowsEx(EWX_REBOOT J EWX FORCE, 0) && i <'20) 
{ " 

LogToFile ( "AgentWatchDog - failed InitializeSystemShutDown" ) ; 

LPVOID pErrMsg = GetSysErrorMsg (GetLastError ()) ; 

LogToFile ( (char* ) pErrMsg) / 

LocalFree (pErrMsg) ; 

Sleep(15000) ; 

i + +; 

} 

// if we haven't rebooted by now, give up 
return 0; 



[0041] Although a couple embodiments of the present invention have been shown and 

described, it will be appreciated by those skilled in the art that changes may be made in these 
embodiments without departing from the principles and spirit of the invention, the scope of 
which is defined in the claims and their equivalents. 
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